Analysis of Pinaceae gene 
family evolution suggests 
accelerated gene turnover 

in pine trees 


Claudio Casola 

Department of Ecosystem Science and Management 


# genes 


Duplicated genes are common in genomes 


40000 

30000 

20000 

10000 

0 




Dark color = all genes Light color = duplicated genes 

2 





















Comparative genomics in forest trees 


"The understanding of gene function in forest tree species is 
challenged by the lack of standard reverse-genetic tools 
routinely used in other systems [..] and by the long generation 
times. Thus, comparative genomics becomes the more 
powerful approach to understanding gene function in trees." 

Neale et al. " Open access to tree genomes: the path to a better 
Forest" Genome Biology 2013 
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Gene turnover: the combination of gene duplications and gene losses 
in a given lineage through time—Change in gene family size 









Gene evolution in Pinaceae 
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Completeness of gene annotation in seed plants 
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Outline 


© Different rates of gene turnover in Pinaceae 

1. Issues with gene fragments (false gene duplications) 

2. Issues with gene duplications vs. gene losses 

3. Issues with species phylogeny 


© Rates of gene turnover in Pinaceae vs. angiosperms 

Is there variation in gene turnover between angiosperms and Pinaceae? 
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Outline 


© Different rates of gene turnover in Pinaceae 

1. Issues with gene fragments (false gene duplications) 

2. Issues with gene duplications vs. gene losses 

3. Issues with species phylogeny 
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Gene turnover in four Pinaceae 


Pi 


PS 


Loblolly pine 


Sugar pine 


Norway spruce 


Douglas-fir 


i—■—i—■—i—■—i—■—i—■—i 

200 160 120 80 40 0 

Mya 


14 












Observed genes in gene families 
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Estimated ancestral genes in gene families 
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Estimated ancestral genes in gene families 


16,118 Gene families 
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Different gene turnover rates? 
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Gene annotation in Pinaceae 
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Gene fragments in Pinaceae 
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Gene fragments in Pinaceae 
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Complete gene set (CGS): all genes in gene families included -> 104,058 genes 

Filtered gene set (FGS): only genes longer than >75% of longest gene in the same 
family included -> 75,450 genes 









Gene turnover in four Pinaceae 
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Gene turnover in four Pinaceae 
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Gene turnover in four Pinaceae 
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Models of gene turnover evolution 
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Models of gene turnover evolution 
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Models of gene turnover evolution 
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Best model for CGS 
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Best model for FGS 
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Higher rates of gene turnover in pine trees 

for both gene sets 
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Removing possible biases due to duplication vs. loss of genes 
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Removing possible biases due to duplication vs. loss of genes 

FGS 75,450 genes 
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Removing possible biases due to # of pine trees species 
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Removing possible biases due to # of pine trees species 
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Removing possible biases due to # of pine trees species 
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Removing possible biases due to # of pine trees species 
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Outline 


© Rates of gene turnover in Pinaceae vs. angiosperms 

Is there variation in gene turnover between angiosperms and Pinaceae? 
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Gene turnover in seed plants 
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Gene turnover in Pinaceae is higher than other eukaryotes 
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Gene turnover in Pine trees is higher than in angiosperm 
lineages with no Whole-Genome Duplication (WGD) 
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Gene turnover in Pine trees is higher than in angiosperm 
lineages with no Whole-Genome Duplication (WGD) 
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Contrasting Rates of Molecular Evolution and Patterns of 
Selection among Gymnosperms and Flowering Plants 

Amanda R. De La Torre ,*' 1 ' 2 Zhen Li , 3 ' 4 Yves Van de Peer , 3 ' 4 ' 5 and Par K. Ingvarsson 2 ' 6 


Gymnosperm rates of molecular evolution per unit time are, on average, 
seven times lower than angiosperm species 

"Longer generation times and larger genome sizes are some of the factors 
explaining the slow rates of molecular evolution found in gymnosperms/' 
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Contrasting Rates of Molecular Evolution and Patterns of 
Selection among Gymnosperms and Flowering Plants 

Amanda R. De La Torre ,*' 1 ' 2 Zhen Li , 3 ' 4 Yves Van de Peer , 3 ' 4 ' 5 and Par K. Ingvarsson 2 ' 6 


Contrasting rates of gene turnover and nucleotide 
substitutions among Pinaceae and flowering plants 
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Conclusions 


Accelerated gene turnover in pine trees vs. two other Pinaceae 

This pattern is robust to potential biases due to presence of gene 
fragments, gene duplications vs. gene losses and presence of multiple 
pine trees in the phylogeny 

Gene turnover may be accelerated in conifers as opposed to angiosperm 
lineages that did not experience whole-genome duplications 
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